Detecting Harmful Web Documents Based on Web Document Analyses
نویسندگان
چکیده
منابع مشابه
RRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملWeb Document Clustering based on Document Structure
Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To achieve more accurate document clustering, document structure should be reflected in the underlying data model. This paper presents a framework for web document clustering based on two important concepts. The first one is the web document structure, which is currently ...
متن کاملAn Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کامل2-Way Text Classification for Harmful Web Documents
The openness of the Web allows any user to access almost any type of information. However, some information, such as adult content, is not appropriate for all users, notably children. Additionally for adults, some contents included in abnormal porn sites can do ordinary people’s mental health harm. In this paper, we propose an efficient 2-way text filter for blocking harmful web documents and a...
متن کاملRecommending Web Documents Based on User Preferences
Eric J. Glover1;2 Steve Lawrence1 Michael D. Gordon3 William P. Birmingham2 C. Lee Giles1 fcompuman,lawrence,[email protected] fcompuman,[email protected] [email protected] NEC Research Institute1 Artificial Intelligence Laboratory2 Business Administration3 4 Independence Way University of Michigan University of Michigan Princeton, NJ 0854
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The KIPS Transactions:PartD
سال: 2005
ISSN: 1598-2866
DOI: 10.3745/kipstd.2005.12d.5.683